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PROBE 

The present invention relates to DNA probes for 
detecting a tandemly-repea ted nucleotide sequence in the 
gene encoding mucin glycoprotein expressed by human 
mammary epithelial cells, to the use of the probe in 
diagnosis and in "fingerprinting" individuals, to the 
polypeptides expressed by the corresponding mucin gene, to 
antibodies against the polypeptides and to the use of the 
polypeptides and antibodies in the diagnosis and 
therapeutic treatment of cancer. 

Normal and malignant human mammary epithelial, 
cells express high molecular weight glycoproteins (gps) 
which are extensively glycosylated and very antigenic. As 
a result, many of the monoclonal antibodies (MAbs) selected 
for reactivity with human breast cancer and other 
carcinomas are found to react with molecules which are 
produced in abundance by the fully differentiated human 
mammary tissue and are found in the milk fat globule (MFG) 
and in milk. However, the level of expression of a. 
particular antigenic determinant may be different in the 
gps produced by the normal differentiated cell and in the 
similar molecules produced by breast cancers. This means 
that some antibodies can show a certain specificity for 
reacting with tumour gps. 



WO 88/05054 PCT/GB88/0001 1 

-2- 



The molecules bearing the epitopes recognised by 
these antibodies are complex and have been difficult to 
analyse, both because they are large and heavily 
glycosylated (>250 / 000 daltons) and because of the complex 
pattern of expression. Two of the MAbs , HMFG-1 and -2, 
react with a component in human milk which appears to be 
greater than 40.0,000 daltons, whereas the molecules found 
in sera and tumours are smaller, although the dominant 
components are still greater than 200,000 daltons on 
immunoblots. The large glycoprotein produced by the 
differentiated mammary epithelial cells found in human milk 
or in the milk fat globule has been purified and shown to 
have some of the characteristics of the mucins. This 
component contains a large amount of carbohydrate joined in 
O-linkage to serine and threonine residues via the linkage 
sugar N-ace ty lgalactosamine . Moreover, the core protein 
contains high levels of serine, threonine and proline and 
low levels of aromatic and sulphur containing amino acids. 

These mucin-like glycoproteins are also secreted 
by a number of other normal epithelial cells. The 
monoclonal antibody HMFG-1 is highly reactive with the milk 
mucin and evidence suggests that the epitope recognised by 
this antibody is more abundant on the fully processed 
mucin, characteristic of normal differentiation. 

In tumours, the molecular weight of the molecules 
carrying these antigenic determinants differs among 
individual tumours and, in the case of the components 
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recognised by the HMFG-2 antibody, can range from 80-400K- 
daltons. Although it appears that the differences observed 
.in the mobility of the high molecular weight bands are due 
to genetic polymorphism this probably does not explain 
variations in the size of the lower bands. It has been 
proposed that these may be the result of aberrant 
processing occurring in the tumour cell possibly within the 
glycosylat ion pathways. 

For the majority of the monoclonal antibodies 
reacting with this group of molecules the exact nature of 
the antigenic epitopes remains unclear but circumstantial 
evidence has suggested tha t. carbohydrate may at least be 
partly involved in many of the epitopes. Moreover, from 
previously available data it was not known whether the 
mucin found in the normal differentiated cells, and that 
observed in the tumours, contain the same core protein, or 
just carry common carbohydrate determinants. 

Mucin has now been isolated from human milk by 
affinity chromatography enabling identification of the core 
protein and the gene encoding the protein. This has been 
found to be a highly polymorphic gene defined by the peanut 
urinary mucin (PUM) locus [see Swallow e_t al . , Disease 
Markers , 4, 247, (1986) and Nature , 327 , 82-84 (1987)]. 
The gene product, which is hereafter referred to as human 
polymorphic epithelial mucin or HPEM, has been detected in 
breast tumours and other carcinomas as well as in some 
normal epithelial tissues. 
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It has now been found that the HPEM core protein 
has epitopes which also appear in the aberrantly processed 
gps produced by adenocarcinoma cells. Certain of these 
epitopes are not exposed in the fully processed mucin 
glycoprotein produced by the lactating mammary gland. 

In one aspect the present invention therefore 
provides an antibody against a human mucin core protein ■ • 
which antibody substantially does not react with a fully 
processed human mucin glycoprotein. 

As used herein the term "antibody" is intended to 
include fragments of antibodies bearing antigen binding 
sites such as the F(ab') 2 fragments. 

Antibodies according to the present invention 
react with HPEM core protein, especially as expressed by 
colon, lung, ovary and particularly breast carcinomas, but 
have reduced or no reaction with the corresponding fully 
processed HPEM. In a particular aspect the antibodies 
react with HPEM core protein but not with fully processed 
HPEM glycoprotein as produced by the normal lactating 'human 
mammary gland. 

Antibodies according to the. present invention 
have no significant reaction with the mucin glycoproteins 
produced by pregnant or lactating mammary epithelial 
tissues but react with the mucin proteins expressed by 
mammary epi thelia 1 " adenocarc inoma cells. These antibodies 
show a much reduced reaction with benign breast tumours and 
are therefore useful in the diagnosis and localisation of 



breast cancer as well as in therapeutic methods. 

The antibodies may be used for other purposes 
including screening cell cultures for the polypeptide 
expression product of the human mammary epithelial mucin 
gene f or fragments thereof, particularly the nascent 
expression product. In this case the antibodies may 
conveniently be polyclonal or monoclonal antibodies. 

Antibodies according to the present invention may 
be produced by innoculation of suitable animals with HPEM 
core protein or a fragment thereof such as the peptides 
described below. Monoclonal antibodies are produced by the 
method of Kohler & Milstein (Nature 256 , 495-497/1975) by 
immortalising spleen cells from an animal innoculated with 
the mucin core protein or a fragment thereof, usually by 
fusion with an immortal cell line (preferably a myeloma 
cell line) , of the same or a different species as the 
innoculated. animal , followed by the appropriate cloning and 
screening steps. 

In a particular aspect the present invention 
provides the monoclonal antibodies designated SM3 against 
the HPEM core protein. In another aspect the invention 
provides the hybridoma cell line which secretes the 
antibodies SM3 and has been designated HSM3. Samples of 
HSM3 have been deposited with ECACC on 7th January 1987 
under accession number 87010701. 

Using antibodies according to the invention it 
has been possible to screen a phage library constructed 
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• from mRNA isolated from a human breast cancer cell line to 
identify sequences coding for portions of the mucin core 
protein. Complementary DNA sequences have been constructed 
and from these it has surprisingly been found that the gene 
encoding the core protein contains multiple tandem repeats 
of a 60 base sequence leading to considerable polymorphism 
sufficiently extensive that cDNA fragments corresponding to 
the repeat sequence would be useful for fingerprinting DNA. 
The fingerprinting thus made possible has applications in 
for instance ascertaining whether bone marrow growth after 
transplants is from the host or the donor and in forensic 
medicine for identifying individuals using body tissues or 
fluids. 

Accordingly the present invention also provides a 
nucleic acid fragment comprising at least 17 nucleotide 
bases the fragment being hybridisable with at least one of 
a) the DNA sequence 
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ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 
GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC- 
ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 
GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC ' 
b) DNA f complementary to the DNA of a) , i.e. of sequence 
GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 
CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT— 
GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 
CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT ' 
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c) RNA having a sequence corresponding to the DNA sequence 
of a) and 

d) RNA having a sequence corresponding to the complementary 
DNA sequence of b). 

The sequences in (a) and (b) each include a double 
tandem repeat sequence of 120 bases. Fragments according to 
the invention may correspond to any portion of this sequence 
including portions bridging the start point of the repeat. 

Fragments according to the invention will 
hybridise under conditions of low stringency with the DNA 
and RNA sequences (a, to (d) above. Preferred fragments are 
those which also hybridise under conditions of high 
stringency. The most preferred fragments of the invention 
are those which have sequences exactly identical to, or 
exactly, complementary to the sequences (a) to (d) above. 

Normally a given DNA or RNA fragment according to 
the invention will be capable of hybridising with both DNA 
according to a) and RNA according to c) or with both DNA 
according to b) and RNA according to d) above. 

Preferably the nucleic acid fragment according to 
the present invention will co, -rise a portion of at least 30 
nucleotide bases capable of hybridising with at least one of 

a) to d) above, more preferably at least 50 such bases and 
most preferably the fragment contains a sequence of 60 bases 
exactly complementary to one of the repeat sequences of a,, 

b) O or d) above. Other fragments of the invention may. 

comprise two or more repeats of ,„,h a - 

epeacs of such a sequence, optionally 
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with minor variations by way of substitution. Preferably 
such fragments include an integral number of such repeat 
sequences. Further fragments of the invention comprise the 
tandem repeat sequence and additional coding or non-coding 
5* and/or 3 1 flanking sequences corresponding to the HPEM 
gene or a portion thereof. 

When the existence of a tandem repeat sequence was 
first identified it was believed that the sequence consisted 
of 59 base pairs corresponding with the sequences indicated 
in (a) and (b) above except for the lack cf the base 
indicated with "* rt . 

Many fragments according to the invention as 
originally defined in British Patent Application No. 8700269 
also conform with the new definition of fragments as set out 
herein and those fragments of sequences defined under (a) , 
(b) , (c) or (d) above which do not include the bases marked 

form a particular aspect of the present invention. Such 
fragments have sequences corresponding to at least a portion 
of the sequences - 
a 1 ) GTG GGC TGG GGG GGC GGT GGA GGC 

a 1 ') CGG GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC AC 

b f ) DNA complementary to the sequence of a') or a 1 '), 

c f ) RNA having a sequence corresponding to the sequence of 

a') or a") and 
d 1 ) RNA having a sequence corresponding to one of the 

complementary DNA sequences of b 1 ) 

In the human genome the DNA . tandem repeat sequence 



comprises antiparallel double stranded DNA, one strand 
.having sequence (a) and being paired with a strand having 
seguence (b) . 

As mentioned above the nucleic acid fragments of 
the invention may be used as a probe for detecting one or 
other strand of the DNA tandem repeat sequence in the human 
genome, or RNA transcribed from either strand and hence for 
identifying the gene or genes for human mucin core proteins, 
mRNA transcribed therefrom and complementary DNA and RNA. 
For such purposes it may be convenient to use the complete 
normal gene comprising at least one tandem repeat sequence, 
or mRNA transcribed therefrom or to attach non-complementary 
fragments to either or both the 5' and 3' ends of a fragment 
according to the invention and/or to attach detectable 
labels (such as radioisotopes, fluorescent or enzyme labels) 
to the probe or to bind the probe to a solid support. All 
of these may be achieved by conventional methods and the 
nucleic acid fragments of the invention may be produced de 
novo by conventional nucleic acid synthesis techniques. 

The nucleic acid fragments of the present 
invention may also be used in active immunisation 
techniques. In such methods the fragment codes for a 
polypeptide chain substantially identical to a portion of 
the mucin core protein and may be extended at either or both 
the 5' and 3' ends with further coding or non-coding nucleic 
acid sequences including regulatory and promoter sequences, 
marker sequences and splicing or ligating sites. Coding 
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sequences may code for corresponding portions of the mucin 
core protein chain or for other polypeptide chains. The 
fragment according to the invention, together with any 
necessary or desirable flanking sequences is inserted, in an 
appropriate open reading frame register ,. into a suitable 
vector such as a plasmid or a viral genome (for instance 
vaccinia virus genome) and is then expressed as a 
polypeptide product by conventional techniques. In one 
aspect the polypeptide product may be produced by culturing 
appropriate cells transformed with a vector, harvested and 
used as an immunogen to induce active immunity against the 
mucin core protein. In another aspect the vector, 
particularly in the form of a virus, may be directly 
innoculated into a human or animal to be immunised. The 
vector then directs expression of the polypeptide in vivo 
and this in turn serves as an immunogen to induce active 
immunity against the mucin core protein. 

The invention therefore provides nucleic acid 
fragments as hereinbefore defined for use in methods of 
treatment of the human or animal body by surgery or therapy 
and in diagnostic methods practised on the human or animal 
body. The invention also provides such methods for • 
treatment of the human or animal body by surgery or therapy 
and diagnostic methods practised In vivo as well as e>c vivo 
and ij^ vitro . 

The invention further provides a polypeptide 
comprising a series of residues encoded by the DNA tandem 
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. re p" ea t sequencer the sequence shown at (b) above being the 
coding sequence. Polypeptides according to the invention 
are selected from any of those having 5 or more amino acid 
residues represented by the iollowing amino acid sequence 

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr 
Ala Pro Pro Ala His Gly*Val Thr Ser Ala Pro Asp Thr Arg 
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly 
("*" marks the start of the repeat in the peptide). 

Polypeptides according to the invention may have 
a sequence corresponding with any portion of the 40 
residue sequence above and may include the start point of 
the repeat sequence. 

Other polypeptides according to the invention 
include three or more repeats of the 20 amino, acid repeat 
sequence. Such polypeptides may include minor variations 
by way of substitution of individual amino acid residues. 

The invention further provides polypeptides as 
defined above modified by addition of N-acetyl 
galactosamine (a linkage sugar) on serine and/or threonine 
residues and by addition of oligosaccharide moieties to 
that or via other linkage sugars and/or fragments linked 
to carrier proteins such as keyhole limpet haemocyanin, 
albumen or thyr oglobul in . 

Preferably the polypeptide comprises at least 10 
amino acid residues of the sequence above, more preferably 
20 residues. The polypeptide may comprise the full 
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sequence above. Such polypeptides may further comprise 
additional amino acid residues, preferably conforming to 
the amino acid sequence of HPEM core protein. 

In a further aspect the present invention 
provides the HPEM core protein. This is encoded by the 
PUM gene and may be produced by recombinant DNA techniques 
and expressed without glycosylat ion in human or non-human 
cells. Alternatively it may be obtained by stripping 
carbohydrate from native human mucin glycoprotein which 
itself may be produced by isolation from samples of human 
tissue or body fluids or by expression and full processing 
in a human cell line. The HPEM core protein may be used 
for raising antibodies in animals for use in passive 
immunisation, diagnostic tests and tumour localisation and 
in active immunisation of humans. 

The invention further provides antibodies 
(monoclonal or polyclonal) , and fragments thereof, against 
any of the polypeptides described above. Such antibodies 
may be obtained by conventional methods and are useful in 
diagnostic and therapeutic applications. 

The invention further provides antibodies 
(monoclonal or polyclonal), or fragments thereof, linked 
to therapeutically or diagnos t ically effective ligands. 
For therapeutic use of the antibodies the ligands are 
lethal agents to be delivered to cancerous breast or other 
tissue in order to incapacitate or kill transformed cells. 
Lethal agents include toxins, radioisotopes and 'direct 
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• killing agents 1 such as components of complement as well 
as cytotoxic or other drugs. Further therapeutic uses of 
the antibodies inclusive passive immunisation. 

The invention further provides therapeutic 
methods comprising the administration of effective 
non-toxic amounts of such antibodies or fragments thereof 
and antibodies or fragments thereof for use in therapeutic 
treatment of the human or animal body. Especially in 
therapeutic applications it may be appropriate^ to modify 
the antibody by coupling the Fab region thereof to the Fc 
region of antibodies derived from the species to be 
treated (e.g. such that the Fab region of mouse monoclonal 
antibodies may be administered with a human Fc region to 
avoid immune response by a human patient) or in order to 
vary the isotype of the antibody. 

In the diagnostic field the antibodies may be 
linked to ligands such as solid supports and detectable 
labels such as enzyme labels, chromophores and 
fluorophores as well as radioisotopes and other directly 
or indirectly detectable labels. Preferably monoclonal 
antibodies or fragments thereof are used in diagnosis. 

The invention further provides a diagnostic 
assay method comprising contacting a sample suspected to 
contain abnormal human mucin glycoproteins with an 
antibody as defined above. Such methods include tumour 
localisation involving administration to the patient of 
the antibody or fragment thereof bearing a detectable 
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label or of an antibody or fragment thereof and, 
separately simultaneously or sequentially in either order 
a labelling entity capable of selectively binding the 
antibody or fragment thereof. The invention also provides 
antibodies or fragments thereof for use in diagnostic 
methods practised on the human or animal body. 

Particular uses of the antibodies include 
diagnostic assays for detecting and/or assessing the 
severity of breast, ovary and lung cancers. 

Diagnostic test kits are provided for use in 
diagnostic assays and comprise antibody or a fragment 
thereof, optionally suitable labels and other reagents 
and, especially for use in competitive assays, standard 
sera. 

The invention will now be illustrated by the 
following Examples and with reference to the figures of 
the accompanying drawings in which 



firvrt Legends 

yi F ure 1; P"J-i«c»tion cf the milk «ucia by i=uno affinity 
hrcaatorraphy naing'the matibedy HL7G-1. Milka froa .ertral 
iadiridual. «re coabiaed aad ab.orbed to a HkTG-l-Sepbaro.e column aa 
d.acrib.d 1b Method,, ft. material .l ntia , at lo, pH vae iodiaated 
and subjected to PAGI el.etrcpbor.ai. aad autoradiography (track 1). 
»• iodiaated material v« precipitated „ing the Frot.ia A method 
vita aatibodiee HV£?G-1 (track 5), K^G-2 (track 2), ET 25 4 (track 3) 
aad RB£i 4 20* res (tra=k 4). 

ri ^ re 2 ; C°=P"i«" of the 125 J labelled p.rified milk aucia vitb 
iaauaoblot cf teaa aki=ed ailk. a, huaan .kiaaed ailk rae aubjected 
to EDS polyacrylaaide electrophoreaia . transferred to aitrocellulcee' 
paper, the blot probed with the aoaecloaal aatibody. JflOO-1 aad biadiag 
detected uaiag aa ZL2SA method. B, after purification ca u H1£?G-1 
affinity colaaa followed by G75 Sephadex chroaatograpby the rilk aucin 
»as iodiaated by the Bolton mad Hunter aethod aad aubjected to EDS 
polyacrylaaide •lectrophoreaia aad autoradiography . 

Zi£HIi_S: autoradiography cf the iodiaated milk aucia after treatment 
vita hydrogen fluoride. The purified milk mucin vaa treated »ith KT 
for 5 houra at rcoa tarparature (track 1) or 1 hear at 4*C (track 2) 
*nd the vaulting preparatioaa vara thea iodiaated mad ma oa EDS 
polyacrylaaida gals. 
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Tifm 4 ; «»"tiTity of the intact, partially .tripped or extensively 
.tripped .Ilk amci* rtth iodinat.d lectins. II, furltitd intact B ii k 
»ucin (track 1). tie eucia treated with K7" for 1 Lour «t 4'C (track 2) 
and the »ucin treated for S hour, at roon temperature (traok S) were 
.ubjected to SOS polyacrylanide electrophoresis and then tranaferred 
to nitrocellulose paper. Ihe paper was then probed with 125 I PKA 
(peanut agglutinin), ™h r CA (wheat gem agglutinin), or 125 J EPA 
(Helix pooatia agglutinin). 

21rure_5: I=uncprecipitation and i^rur.ohlets of the partially and 
extensively .tripped rucin. A, the 125 I extensively .tripped tt „cin 
was ix=unoprecipitsted with SK-3 (track 3), Kv-C-2 (track 2) or K32 
rediun a. a control (track 1) by the protein/ plate »ethod (.ee 
Material, and Methods). B, the partially .tripped suein (track 1) or 
extensively .tripped nucin (track 2) was run on SDS polyacrylanide 
Eels and transferred to nitrocellulose paper. The blot was then 
reacted with a cocktail of SK-3 and S¥-4 monoclonal antibodies and the 
binding detected using an ZLISA eethod. 

Zi£Hre__6: Raactivity of nonoclonal antibodies SK-3 and HXCFG-2 with 
aethacara fixed breast tia.ue and tunour .ection. using an indirect 
i=unop«roxidase staining aethod. Infiltrating ductal' carcinona 
«ho»Ing .trong reactivity with both S1/-3 (1) and EXTG-2 (B) 
Tibroadenoaa ahowing no reactiTity with SV-3 (C) and .trong 
ieterojeaeou. .tainlng of the .pitbeliun with (D). Fapillona 

•bowing Tery weak re.ctirity with SX-3 (Z) and strong positivity with 
K^C-2 (7). Both nomal resting breast (G) and lactating breast (I) 
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«r. n. Mtl „ wh.n .t.in.d »ith SU. S . » W „. both 

po. I tir. 1 , with ith tauth, br ... t (J) Mci , treoier ^ 

aornal reatin* breaat (H) . 
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Figure Legends 

figure 7. Periodic acid-silver stained milk mucin after antibody 
affinity coluaa and gel filtration column. Milk mucin was purified on 
an antibody affinity column (lane 1) followed by passage 

through a G75 Sephadex column (lane 2} f subjected to 
KaDodSO^/polyacrylamide gel electrophoresis, and silver atained 
following treatment of gels with 0.2JJ periodic acid. 

Figure Silver atain of partially and totally stripped core 

protein from milk mucin. The purified milk mucin was deglycosylated 
by treatment with anhydrous hydrogen fluoride for 1 hr at 0*C (lane 1) 
and 3 hr at room temperature (lane 2) , aeparated by electrophoresis 
through a NaDodSO^/polyacrylamide gel (10%) and silver stained. 

Figure 9 . Immunoprecipitation with MAbs of in vitro translated 
protein products from HCF-7 poly(A) + RNA. Foly(A)* RNA from MCF-7 
cells was translated in a rabbit reticulocyte lysate system (Amershan) 
in the presence of [ 35 S]methionine .(1000 Ci/xnaole; ICi «= 37 GBq) 
following the manufacturer's conditions. Samples containing 5x10* 
acid precipitable cpm were precipitated with KAba SW-4 (lane a), SU-3 
(lane b), HU7G-2 (lane c) t M£FG-1 (lane d) and an irrelevant KAb to 
interferon (lane e, 24 ) t separated on a XaDodSO^/polyaerylamide gel 
(10%), impregnated with Amplify and exposed to IAJU5 film at -70*C for 
20 dayB. 



SUBSTITUTE SHEET 



... $53? \:=$*f — >: ' — 

) ') 

" 19 

Figure 10. laauaoblot analysis of fuBion proteinB froa the Xsuc 
clones. The phage clones xifUC 5,4,6,7,8,9 and 10 were used to 
lysogenize bacterial strain T 1089. Imogens were grown at 32 *C, 
shifted to 42 'C, and then induced with IPTG. Lysogen proteins were 
fractionated by electrophoresis through a NaDodSO^polyacrylaaide gel 
(7.5S), tranaferred to nitrocellulose , and reached with HU7G-2. The 
binding was detected with an ELISA nethod using 4-chloro-1 -oaphthol as 
the substrate. The numbers are those of the X clones. 

•Figure 11. .li/bridization of plfUCIO to cDKA insertB of pl£3C clones. 
DKA froa the plasnid clones was digested with restriction enryne EcoRI 
to excise the cDNA inserts, separated by electrophoresis on 1.4£ 
agarose and transferred to Biodyne nylon awabrane. The filter was - 
hybridized using standard conditions (34) to the insert from plfUCIO 
which was labelled with fa- S2 p]dCTP by the method of randoa priaing 
(41). Lanes: plaBaid clones 3,4,6,7,8,9,10. 

Figure 12. RNA blot hybridization analysis of aaaaary breast eucin 
aRKA. 10. pg of total RKA froa huaan breast cancer cells MC7-7 (lane 
1) and T4TO (lane 2), noraal huaan aaaaary epithelial cells h»ut (lane 
3), huaan embryonic fibroblasts ICR? 23 (lane 4), Daudi cells (lane 5) 
and carcinosarcoaa HS578T cells (lane 6) were separated in a 1.3% 
agarose/glyoxal gel, blotted on to nitrocellulose and hybridized to 
the pynC10 EcoRI insert which was labelled with [c-^pJdCTP by the 
■ethod of randoa priaing (41). The size Barkers were 28S (5.4 kb) and 
18S (2.1kb) rRNAs. 
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Pi pure J3. Polymorphic huaan DKA fragments detected by 
hybridi ration with pUUCl 0 probe. Genoaic. DKA saaples. prepared froa 
the white blood cells froa ten individual (six unrelated) and froa 
three cell lines were digested to completion with Hinfl and ZcoM, 
fractionated by slectrphoresis through 0.7* and 0.6* agarose, 
respectively, and transf erred to Biodyne nylon .eabranes. The filter 
was hybridised to the pWJClO PNA insert which was labelled with 
ra p]dCTP by the nethod. of random priming (41). X- ray tila nB 
•xposed for 1 day at -70'C with intensifying screens. Lan B8 1-4 
father, two daughters and mother, lanes 5-10 unrelated individuals, 
lane 11 is MCF-7, lane 12 is ZR75-1, lane 13 is. 3CR?-23. The DKA 
staples exhibit a wide distribution of sizes. Kuabers indicate length 
of DKA in kb. The apparent bands at 231b are in lanes 12 and 13 are 
artefacts introduced in autoradiography 
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Example 1 

Purlficetlen of the ellk nueln 

The cilk cucin was reified fron kuaan akianed ailk by passage 
through an affinity colura followed by size axcluBioa 

chroaatograpby. The Kk£7G-1 ssoaocloaal antibody was purified froa 
tissue culture supernatant using a protein A eolurui (1). The purified 
antibody was coupled to eyaaogea broside activated sephtrcee 
(Phamacia) aa deBcribed ia the raaufeeturer »s iaEtruction 8 . Human 
Btisned cilk -ras ptBBed in batches of 100 ttl through the antibody 
ce3a=n followed by extensive ashing with P2S.' Bound tctitca was 
elated fro= the column nEiug 0.1 i£ glyciae pK 2.5 and the- fractions 
registering tn optical deasity at 2B0aa were pooled, dialysed against 
0:25 it acetic acid aad lyophilixed. Bitches of about 20 cgs were 
dissolved ia 0.25 V acetic acid and passed through a G75 Sephadex 
coluaui (1 x 100 en) which had beoa previously equiligrated with acetic 
acid. The eoluna was washed with 0.25 V acetic acid and id fractioas 
collected. The peak fractions which were eluted ia the void voiuae 
were pooled, lyophilixed and the dry powder stored at 4'C. Aaino acid 
analysis was perforaed using a Beckaaa 6300 aaiao acid aaalyser. 

PcglTCOB ylation of the nilk eucia 

To rraove the O-linked carbohydrate froo the aiilk arucia the 

siolecule -was treated with anhydrous hydrogen fluoride as described by 
*ort and I*nport (22), for either 1 hour at 4'C which produced a 
partially stripped preparatioo, or 3 hours at roon tecperature which 
produced the exteasively stripped aueia. 
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lodination of the cilk cucin 

lodinaticns of the purified mucin, the partially or extensively 
■tripped mucin were carried cut using the Bolton and Hunter method 
(51). Briefly, the mucin, 2.5 W la 20 |d 0.1H borate buffer pH 8.5, 
was added to the dried Bolton and Hunter reagent (1 .Ci, amersham 
International pic) and incubated at rooo temperature for 15 minute* . 
The reaction was .topped by the addition of 0.5 ml of 0.2M glycine in 
borate buffer and after a further 15 minutes incubation, free Bolton 
and Hunter reagent was removed by passage through a 025 Sephadex 
column (PD10 columns, Pharmacia) previously equilibriated in PBS. 

lodination of Lectins 

Wheat germ agglutinin (WGA), peanut agglutinin (PNA) (Factor Labs) 
and Helix pomatia agglutinin (HPA) (Boehringer) were iodinated as 
described by Zarlsson et al. (52) using the chloramine T method. 

Polvac rvlanide eels and Western biota 

Polyacrylamide gel electrophoresis and immunoblotting was 
performed as described previously (1). Briefly, samples were run on 
5-1 5% polyacrylamide gels and then slectrophoretically transferred to 
nitrocellulose paper (Schleicher and Schuell) at 50 Yolta overnight 
*t 4*C (36). In the immunoblotting experiments the paper waa reacted 
with monoclonal antibodies and binding detected with an XLISa method 
wing 4-chloro-1-naphthol as the substrate. For lectin binding 
studies the Western blots were reacted with the iodinsted lectins as 
described by Swallow st al. £8). 
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Production of monoclonal entibodiea 

A feaale BALB/c souse was ianunised with 5 »»g of the partially 
• tripped milk aucin in Preund's coaplete adjuvant and 3 aonths later 
boosted with a further 5 *g of the sane preparation in Prsund's 
iaeoaplste adjuvant. After a further 20 days, 5 ug of the aucin 
•xtensively stripped of ite carbohydrate wai given intravenously in 
■aline solution. She spleen was renoved 4 days later, and fused with 
the WS2 aonee nyeloaa cell line (53). 

Screening of hvbrldo=a Bupematar.t and insunonr-eciMtationfl 

The screening assay was a nodification- of that described by Melero 
and Gonxalez-Rodrigue* (54>. Multiwell platee were coated with 50 »il 
of 0.1 ag/al protein A (Pharaacia fine Cheaicals) in PBS and allowed 
to dry overnight at 37'C. Ihe plates were blocked with 5% BSA for 1 
hour. »t SVC followed by the addition of 50 yl of rabbit anti-aouse 
iaaunoglobulin (DAIO. diluted 1:10 in PS S /BSA « PBS/BSA). After 
incubating for 2 hours at 37'C the plates were washed twice with PBS 
containing 1* BSA and 50 cf hybridoaa supernatant added. The 
plates were incubated overnight at 4*C, washed twice with PBS/SSA and 
50 nl of iodinated partially stripped aucin containing 100,000 cpa 
added to saeh well, the plates were then incubated at rooa 
teaperaturs for 4 hours, washed 4 tiaes with PBS/BSA and the 
individual wells counted in a gaaaa counter. Por iaaunoprecipitation 
•xperiaents 50 i»l of SDS saapl. buffer containing dithiothrsitol was 
sdded to sach of the wells which were then boiled for 3 ainutes and 
the buffer loaded onto 5-15* polyacryl aside gradient gela. 
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Staining of tissue aectiona 

Tissues fron priaary masaary carcinomas, benign brsa.it biopsies, 
normal breast, and pregnant lactating brsast tissue were fixed la 
aethacara (methanol chloroform and acetic acid 60:30:10) and embedded 
into paraffin wax. Bectiona were atained with the antibodies using 
the indirect peroxidase anti peroxidase method aa previously described 
(47). 

Results 

Purification of the milk eucin 

The ailk mucin was purified from human akimmed milk on an Hk£FG-1 
antibody affinity colusn. Iodination of the eluted material revealed 
the presence of a large molecular weight component and a 68ED band. 
Precipitation of the affinity purified material with antibodies H£?G-1 
and 107G-2 (tracks 2 and 5) followed by gel electrophoresis showed 
that both the high molecular weight components and the 68ED component 
were precipitated by both antibodies (less effectively by HlffG-2). 
Since the 68ZD coaponent was also precipitated by two unrelated 
antibodies (figure 1, tracks S and 4) and this coaponent was not 
wrident cn an immunoblot of the purified material reacted with KVC7G-1 
(figure 2A), the 681 coaponent was removed by molecular aitve 
chromatography on a 075 column. The final piirified product showed a 
major high molecular weight band, with only a trac e of the 68X 
coaponent and a minor contaminant around 141 (figure £B). 



SUBSTITUTE SHEET 



WO 88/05054 '*p$ . "f0 PCT/GB88/00011 

) - 25- ) 



A hi E h molecular weight clycoprcteia (PAS-O) eoataining more than 
50% carbohydrate ia O-linkage has been purified from the bmaa milk 
fat globule by Shimiru and leaauchi (83. To see whether this 
'component waa similar to the mucia isolated from milk by affinity 
chromatography ca aa affiaity column, the amino acid 

composition of the purified HK7G-1 reactive mucin waa determined and 
compared to the amino acid eompoaition cf the purified PAS-0 
component. Table 1 .hove that there ia food corre.pondence betweea 
the two eets of data. iadicatine that the core proteina of PAS-0 and 
the mucia purified here are the aaae. 

Isolation of the cor e protein nf the milk B »r<, 

Aa there are no eazyaes eaaily available that are efficieat at 
rem^.v 0-liaked sugars, and p elimination often results in damage to 
the protein core, the cligoeaceharidea were removed by treatment of 
the* mucin with anhydrous hydrogen fluoride. This treataeat has been 
•hown by tort and Lamport (211 to be effective in r.moYiag sugar, from 
Pig submaxillary mucin "without damaging the protein core. Amino acid 
analyeis of the material produced after HP treatment of the milk mucin 
suggested that the protein core waa also in this. case nndaaag.d. siace 
the composition was the same aa that seen in the intact arucin (Table 
1). 



Initially the milk mucin waa exposed to HP for only 1 hour at 4\ 
but analysis of the product showed that there was only partial removal 
of the augers with such treatment, and it was nece.sary to treat the 

smcio at room temperature for 3 hours to obtain a molecule which 
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shoved no lectin binding ability. Figure 3 shows an autoradiograph of 
the iodinated products after treatment for 1 hour at 4* (track 2) or 3 
houra at RT (track 1). It can be seen from figure 5 that the milder 
treatment results in a mixture of products made up of high molecular 
weight material which is slightly scalier than the intact mucin and a 
renter of analler bands. After longer exposure to H? at rooa 
tenperature, the high molecular weight banda disappeared resulting in 
polypeptide bandB of about 65ZD and 72ED. 

To teat for the presence of eugara" oa the inttct mucin and on the 
products produced a^ter the two different KF treatments each 
preparation was subjected to acrylanide gel fclectropherc aia , 
transferred to nitrocellulose paper and reacted v/tb 125 I-1 abelled 
lectins. The lectins used rere peanut lectin (PNA) vhich reacts with 
galactose linked to X-acetyl galectosanine , wheat gtna (*GA) reactive 
with N-aeetyl glucosamine and Helix ponatia agglutinin (EPA) which 
reacts with the linkage sugar in O-linked glycosylation, 
X-acetylgalactosanine . Figure 4 chows autoradiography of the reacted 
biota, and it can be seen that while treatment with K? for 1 hr at 4* 
(track 2) altera the lectin reactivity of the mucin, carbohydrate is 
still preaent. Interestingly, however, there is a much lower level of 
binding of FXA to the high molecular weight material of the partially 
stripped srucin than la seen with the intact mucin (track 1). 
Moreover, this loss in FXA binding ability ii accompanied by binding 
of the linkage sugar specific lectin HP A. This lectin shows so 
binding at all trO the intact mucin, and the changed pattern of lectin 
binding after limited treataent with E? indicates that sugars masking 
the O-linked N-acetylgalactosanine have been stripped off. The 
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.nailer eoepoatat eeen both tie intact =-ocin (track 1) la t*. 
partially .tripped preparation (track 2) i. * glycoprotein which 
react, with ¥OA, although not with FXA. Thie »ay correspond to the 
conponent of .icil tr Secular weight (around 681) ..en after affinity 
chxonatography of the rucin »» d aay represent an intermediate 
precur.orao2.ciae. 

Wgure 4 .hows dearly that the 661 and 721 components produced 
after extensive treatment with KF. (5 fcr at RT) , .how no reactirity 
with the lectins (track 5) , including the X-acetylgalactoearine 
.pacific lectin XPA. This observation constitutes .trong .ridence 
that the eugar. have been renovedf rom at least the najority of the 

colecules , and rill j-ef#r + rt t , . . * 

^ refer to r - 5 Frc^treticn ee the •xtenBirely 

stripped «cia. 

Generation of cor.oclo^l ar.tib a «H.. "e.. the silk sveix, core grctein 

A fusion was carried out using the apleen of a aouse that had been 
i=uni*ed with two injections of the partially .tripped rilk aucin 
followed by a boost with the extensively stripped aucin. She clones 
vere initially .creened against the 1 «I partially .tripped material 
using protein A plates (see Methods). Jour hybridooas were .elected 
and cloned, and table 2 .hows their .pectrun of reactivity with the 
intact, partially and extensively .tripped aucin. As can be .een fron 
thi. table three cf the hybridoaa. which were isolated .bowed a .trong 
reaction with the partially and extensively .tripped aucin and did not 
react with the Intact aucin. These appeared to be good candidates for 
.onoclonal antibodie. to the protein core and two, Bl* and EU-4, were 
•elected to be characterised further. 
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It can also be soen from table 2 that tie B£?G-1 and 
antibodies reacted very .trolly with the mucin .tripped of ite 

arbohydr.te. These two antibodies were, in fact, der.lcped using the 
Intact mncin (fro. the milk fat t lobule) as immunogea and, in the case 
of Ka7G-2. growing mammary .pith.lial cell. (U>. B»ir reaction with 
the .tripped mucin was unexpected, as circumstantial evidence had 
previously led to the belief that carbohydrate might form at least 
part of their antigenic epitopes. 

Koleculer Weight of rnleculee gorg ing gtij Lenic Determinants 

The antibody SM-3 was shown to be of the IgGI .ubcla.s, while the 
antibody was found to be Igli. We therefore cho.e to use the SU-3 
antibody in .ubeequent experiments .ince antibodies of the IgVf class 
_ can present problems in some .ppliction. Immunoprecipitaticn of the 
extensively .tripped material with EJUS .bowed a reaction with the 
lectin unreaetive 6BL component (Figure 5A, track 3). The monoclonal 
antibody HVIFG-2 can also be seen to immune precipitate the 
lectin-unreaetive 681 component (track 2). The antibodies were 
reactive with antigen cn immunoblot. and Figure SB .hows the reaction 
of antibody Si-3 with the dominant 68Z band of the extensively 
•tripped mucin (track 2). 

We hare preriously ahown that the molecular weight of the 
component, in breast cancer cell, carrying determinant, found on the 
«ilk mucin 1. lower than 4001 and can wary from one tumour to another 
(1). Reaction of antibody' ffl-S with' Western blot, of gel ..pirated 
extracts of breast tumour cell, ahows that this antibody resets with 
component, of .imilar molecular weight to those r.actire with antibody 
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KM7G-2 (data not shown). Because the antibody EM-3 differs fro= the 
' antibodies BffG-1 and 2 in that it does not react with the intact 
aucin proee.eed by the lectating gland and yet reacta with aolecules 
proeeased by br.ait cancer cells, it was appropriate to axaaine the 
reaction of SM-3 with a range of br.aat cancers. 

Reactivity of SM-3 with brea st tle» u «>, u a traonra 

The antibody SM-3 reacted with paraffin aabedded tissues provided 
these were fixed in aethaearn (not foraal .aline). Using this aethod 
for preparation of tissue sections, the reaction of the antibody was 
eoapared to that of HM7G-2 on breaet tissues and tuaours with an 
indirect ianunopercxidaae .tainiag aethod. This analysis showed a 
draaatic difference in the staining pattern of SM-3 eoapared to that 
seen with HM7G-2. Thua , although a strong positive reaction was seen 
in 20/22 breast cancere stained with SM-3 (as coopered to 22/22 
stained with HM7G-2). aoraal resting breast, pregnant or lactating 
tissues and aost benign lesions were largely anstained with SM-3 but 
were stained with HMFG-2. Some exaaples of staining patterns of breast 
tissues and tuaoure are illustrted in Figure jB. 

Twenty-two priaary careinoaaa and fourteen benign lesions were 
axaained and the reaction cf SM-3 coopered to the staining with HVffG-2 
la each ease. In the priaary carcinoma*, staining with SM-3 was 
lu.terogen.oua but generally quit, .trong and always confined to tuaour 
cell.; conn.ctiv. tissue and atroma showed no reaction (see figures 
6A.B). In the four f ibroadenoaas ex tained, ■taiaing of the .pitheliua 
with EM7G-2 was strong although heterogeneous. In coaparison, 
atainiz* with SM-3 was negative in one case and in the three ethers 
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staining was confined to only one or two glandular elements. HCG- 2 
shoved strong positivity on the fire papillomas and fire eases of 
cystic disease studied while the staining observed with SU-3 was yery 
much weaker and more heterogeneoua (figures 6G.H). The papillomas ae 
. a rroup showed the strongest staining with SM-3, and it can be seen 
that the staining was membranous or extracellular. 

In contrast to and 1DCG-2 which strongly stain lactating 

and pregnant breast. SM-3 was totally negative with thr.e out of six 
caaea of pregnant or lactating breast (see figure 6C sad D). Two 
positive cases shoved only very weak staining of an occasional cell • ■ 
and in the third, staining was confined to two areas of one lobule, 
/gain, in contrast to Hk£?G-1 and HV£?C-2 which do react with some 
terminal ductal lobular units of normal, resting breast (albeit 
weakly), SH-3 was totally negative on sight out of the thirteen cas^ 
tested and in the other five cases staining was extremely weak and 
often confined to one or two acini in the/tissue section (see figure 
6Z snd ?). It should perhaps be noted that the intensity of staining 
with HCFG-2 seen with normal breast tissues and benign lesions fixed 
in methacarn was somewhat higher than that reported preriously using 
formalin fixed material (50,43. 

SM-3 was also shown to be negative on sections of normal liver, 
lung, thymus, sweat gland, spididymua, prostate, bladder, small 
intestine, large intestine, appendix, thyroid snd skin. lie sntibody 
showed weak positire staining only with the distal tubules cf the 
kidney, the occasional chief cell of the stomach, the occasional duct 
cell of the salivary gland snd the sebaceous glsnd. 
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large nolecular weight aucia noleculea are expressed by saay 
carcinoaae and carry a any of the traonr asaociated antigenic 
deteraiaanta raeogaiaed by aonoclonal antibodies. 3hese epitopes cay 
also be expressed by soae aoraal epitheliua, and aeae aonoclonal 
aatibediea like react particularly wis 11 with a srucin found in 

noraal buaan cilk (1,17). JLm long as the atudy of the srueins i» 
reatricted to their detection with antibodies reactive with tmdefined 
epitopes, the knowledge or their structure, arpreaaica aad prcc tEB iag 
will also be reatricted. We hare begun to investigate the structure 
aad expression of the aa=ary nucin by isolating tbe core protein and 
developing antibodies which have allowed us to aelect partial cDNA 
clones for the gene codiag for the core protein . _ This Example 
describes the production and characteriiatioa of theae antibodies. 

Treetaent of the affinity purified silk aucin with hydrogen 

fluoride resulted in the appearance of a doninant band of about 651 
daltonB.aad a ainor species of about 72ED oa SDS acrylaaide gels. 
These banda shoved no reactivity with lectins, including Helix' ponatia 
agglutinin which is specific for K-acetyl galactoaaaine f tbe first 
sugar ia 0-liaked glycosylate (55). it therefore seena probable 
that this 6BI dalton polypeptide repreaenta the core protein of the 
srucin. Supportive evidence for this coaea frea the observatioa that 
the antibodies described here, which are reactive with the stripped 
6BI coaponent, can precipitate a aoleeule of this site frca the in 
ritro translation products of aRXA isolated froa breast cancer cells 
expressing tbe aucin . 
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As the ailk aucin contains at least 50* carbohydrate ( 16 ) , a 
protein core of only 68E> appear* too aaall if the intact aolecule has 
an obierred aolecular weight greater than 40023). Hovever, aucins can 
b coopoaed of aaall subunits which aggregate and are held together by 
a one form of aon-coralent interaction, aa yet not tmderatood. ? or 
•xaaple, although the aolecular weight of the ovine aubaaxillary aucin 
haa been reported to be greater than 1 HO 6 daltoni (4-50, it haa a 
protein eore of only 650 aaiao aeida with a aolecular weight of 58,500 
daltone fc_e ) . 

An rmerpected finding was that the antibodiea H2CG-1 and 
which react with the ailk aucin, aleo ahow a positive reaction with ■ 
the extensively atripped aaterial which ahowed no lectin binding 
capability. Previous indirect evidence, including the reaiatanee to 
fixation, boiling and reduction, the repetitive nature of their 
epitopes and the appearance of aeveral bands on iaaunoblots, had led 
to the belief that carbohydrate present on the ailk aucin was involved 
in these epitopes. This idea was reinforced by the observation that 
lectins could block the binding of JOO'G-I and 2 (1). Ihile it is not 
possible to exclude the possibility that some augars. nndetected by 
the lectin binding experinents , remain on the extenaively atripped 
aucin described here, this is nnlikely to be the explanation for the 
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r.sctivity.of the aatibodi.e and 2. &i. can b. .aid .lace 

both antibodi.a have recently ba.a aho»a to p.act positively with 
t-talacto.idas. fusion prctaia. axpr.aa.d by pha<e carryia* DKA codin* 
for tha cor. protaia of tha .aaaary snacia . It app.ar. tb.r.for. that 
at l.ut part of aach of tha spitop.a recotais.d by Ek7G-1 aad *!?G-2 
coataia aaiao acida bat It «u.t ba a.auaed that soae of tha.a epitopas 
ca tha cora protaia ar. expoe.d. i.e. aot .aak.d ±a tha fully 
flyoo.ylatad .ol.cula. a, .pitope 1. howar.r laaa abuadaat on 

the .ilk aucia thaa tha HUFG-1 .pitopa, while It i 8 raadily detectable 
on the aucia aol.culea expreaaed by tuaour. (1); Ta.aa »ol.culaa have 
a sxall.r aol.cular w.l t ht aad aay b. less heavily dycoaylated or 
polyaerlred. 

Hare wa hare reported the development of a.w- antibodi a. which are " 
reactiv. with tha protaia core of the tt ueia aad with th. partially 
deglycoaylated aolecule, but which ar. anre active with the fully 
proc.a.ed aucia produced by the lactatin* aaaaary .land. Oae of th.se 
antibodies SU-3. which 1. an 2,01. ha. bean .tudied la acre datail. 
It ha. been shown to react with th. aucia aol.cul.a which ar. produced 
by br.aat cancer call, and are receded by aany antibodi.a d.v.lcp.d 
Kein.t th. Intact ail* »ueia. It .hould b. aaphasis.d how.v.r that 
th. epitope reco C ni..d by which 1. on th. cor. protaia aad 1. 

•xpo.ad ia the aucia aa proc.a.ed by tuaour call.. 1. aot .xpo.* ca 
th. aoraally proceed ailk aucia. Jais feature offer, the 
Poaaibility of enhanced tuaour specificity, and a pilot 
i^unohi.toch.aical study of breaat taaour. aad ti.su.a ha. shown that 
indeed th. EU-3 antibody r.act. strongly with th. aajority of priaary 
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breaat cancers (91*) but ahowa little or so reaction with benign 
breaat tnaoura . reating or lactating breaat. and coat normal tiaauea. 

There are aeveral iaplicationa of the work de.cribed here which 
■ay be important for both baaic and clinical studiea in breaat cancer. 
The obaerration that parte of the core protein (detectable by 
aatibodiea) are expoaed on the aucina aa proeeaaed by breaat cancer 
bet maaked on the mucin aa proeeaaed by cell. In nonaal breaat and 
benign tnaoura impliea that there ia an alteration in the proeeaaing 
of the auein in malignancy, a .ore detailed atudy of the proeeaaing 
of the mucin in no real and malignant cella may -then give baaic 
information for defining the malignant cell. Moreover, .ince the 
specificity of the reaction of the antibody SW-3 for tumour* is better 
than that of antibodiea developed againat the intact auein, .thia- 
antibody may prove to be a more effective diagnostic tool for the 
detection of breaat cancer cella in tiaaue sections, tissue fluids and 
cellB. The reactive coaponenta are aenbrane asaociated aa well 
intracellular and in vivo localiaation of tumoura may also be 
possible. 



as 



She aibrerUtions- naed are; BCC, human fat flobule; PBS, 

phoapbata-bufferad .aline (153 alt Had, 5 wM XCL, 10 a* Ka.HPO , 2 a* 
rU 2 P0 4 pH 7 ' 4 > J WGA ' f«» agglutinin; PXA, paanut a* B lutinin; 

HPA. Bflllx pomatia agglutinin ; BSA, boTine serum albumin; SDS. aodium 
dodacyl aulfate. • • 
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Exanrole 2 

Purification and deglycosyla±ion..of hiuian milk mucin was 
conducted as in Example 1 mucin was purified on an HMFG-1 
antibody. 

She stripped mucin preparations were separated by electrophoresis 
through KaDodSO^/polyacrylamide g els (10*) and silver stained by two 
methods, one of which can be rued to stain highly glycosylated 
proteins (22,23). 

Preparation of polyclonal rabbi t antjjerua to stripped core prctpin 

One Kew Zealand White rabbit was iaaunized with 100 |xg of the 
partially stripped -core "protein in complete Preund's adjuvant (Gibco). 
Booster injections of 500 »g of the totally stripped core protein were 
administered in incomplete Preund's adjuvant (Gibco) 5 and 4 weeks 
after the initial injection and the rabbit was bled one week later. 
Ten microliters of immune serua (75 |ig/al protein) precipitated 200 ng 
of fully stripped core protein in a Protein A assay (24) and detected 
it on iaaunoblots. The immunoglobulin fractions of rabbit preiaaune 
and rabbit anti-sueia eere protein were prepared by adding ammonium 
sulfate to 50* saturation. The resulting pellst was resuspended in 
one-half the original serum relume of PBS and dialyxed against the 
same buffer. After dialysis . only residual precipitate was removed by 
centrifugation. Iaaunoglobulin fractions we.-e stored in aliquots at 
-20 'C. 
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Description of kAbs uaed 

In addition to the polyclonal antiserum, used for initial screening 
a cocktail of two MAbs, SM-3 * nd Exairple 1) which recognise the 

sraein core protein (20) and BtfG-1 and HK7G-2 (1 .14) vers used to 
screen the purified plaques, the p-f alactoiid&se fusion proteins and 
for imaunoprecipitatione from in Titro translated proteins*. Other 
MAb« used vera a aonoclonal anti-p^c alactoaidase antibody (25) which 
n * * C 1 ** from H. Durbin (ICR?, London), an anti -interferon antibody, 
ST254 (24), LE61, a keratin antibody (26) and U1 8 which raeocnii.es a 
carbohydrate structure on the cilk aucin (27). 

In Titro translation of proteins 

RJJA was isolated from the huaan breast cancer cell line MC7-7~~ 
usinc the tnianidiun ieothiocyanate aethod of Chircwin et al. (28) and 
poly(A) + RXA was purified by chromatography uainc olico (dT)-celiulose 
( Ns> ~ »»Cl«nd Bio Labs). The poly(A) + RKA vas translated in a 
reticulocyte lysate eystem (Aaerahaa) in the preienee of [ 55 SJ 
aethionine (1 000 Ci/=ole; 1 Ci « 57 GBq, Aaerahaa). Saaplea 
containinc 5x10* acid insoluble cpa vers precipitated in a protein A 
assay (24) usinc KAbs SU-3, SU-4, HiCFG-1 , HU7G-2 and a control 
antibody to Ixuaan interferon. She antibody-selected proteins were 
then separated on a 10$ XaDodSO^olyaerylsaide c«l» iaprecnated with 
Amplify (Aaerehaa) and exposed to 2UU5 fila (lodak) at -70 *C. 



The KAbs and Elf-4 (SU rmUru to stripped srucin) show stronc 

reactirity with the partially and fully stripped core protein but no 
reactivity with the fully clycosylated mucin (20). 
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Ar.tibody screening of Xgtn library and protein blotting 

The 1ft 11 library u B ed la this study was constructed fro= aFJiA 
isolated from the human breast cancer cell line HC7-7 and was 
generously proyided by Philippe Walter and Pierre Chaabon (Strasbourg, 
Prance). The poly (A) + RHA used for the preparation of the randomly 
primed library was prepared froa mRXA that sedimented faster t*«n 28S 
rRKA and was enriched in estrogen receptor (29). She library was Bade 
essentially as described by Euynh et al. and loung and Davis (50-32) 
and contained approximately 1 HO 6 recombinants per fig of RNA. 
Between 85* and 95% of the plaques contained inserts. 

The phage library was plated onto bacterial strain 11090 and grown 
for 3 hr at 42 "C. After isopropyl p-D-thiogalaetoside (IPTG) 
induction and 3 hr of growth at 37'C, filters wera prepared from each 
plate and screened with an ti -mucin core protein antibody by the method 
of Xoung and Davis (32). The first antibody used in screening was the 
rabbit antiserum raised against the stripped core protein prepared as 
described above. Prior to use in ■ereening, the antiserum was diluted 
1:200 in PBS containing 1% bovine serum albumin (PBS/BSA) . 
Preaisorption with 11090 bacterial lysate was not found to be 
necessary. The nitrocellulose filtere (Schleicher and Schuell) were 
blocked by incubation in PBS containing 5* BSA for 1 hr at room 
temperature with gentle agitation. The filter* were incubated at room 
temperature overnight with a 1:200 dilution of antiserum in heat 
scaled plastic bags. The filters were washed 5x5 ain in PBS/BSA, 
and bound antibody was detected by using horseradish 

peroxidase-conjugated sheep anti-rabbit antiserum (Dako) diluted 1:500 ■ 
with PBS/BSA and incubated for 2 hr with the filters. The filters 
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were washed 5x5 min in PBS/BSA and 1 i 10 min in PBS before color 
detection using 4-ehloro-1 -naphthol (1). IcaunoreactiTe bacteriophage 
were picked and purified through two additional rounds of •creaning. 
Subsequently, bacteriophage Inserts were subcloned into the EcoRI 
• ites of pUC8 (S3) producing the plasnid used cost extensively. pWJC 
10. The plead da were maintained in DH1 cells. 

To examine the B-galactosidaae-cDNA fusion proteins for 
immunoreaetivity, cell lysates ware derived. Imogens nre prepared 
as described in Toung and Davis (34). Cells were pelleted, suspended 
in Laemmli sample buffer (35) and separated by electrophoresis through 
KaDodS0 4 /polyacrylanide gels (10%) and transferred onto nitrocellulose 
filters as described (1,36). The filters were treated as above for 
antibody screening. 

Northern Analysis 

RJiA was isolated fron tissue culture cells and frozen tissues by 
the guanidiniua isothiocyanate method of Chirgwin et al. (28). Total 
UNA (10 pg per lane) was denatured by beating at 55'C for 1 hr in 
deionixed glyoxal and fractionated by electrophoresis through & 1.355 
glyoxal gel (38). Bae RNA was transferred to nitrocellulose 
(Schleicher and Schuell), prehybridited and hybridised as described by 
Thomas (34). Filters ware washed down to 0.11 SSC with 0.1 EDS at 
65'C and sxposed to XAR-5 film (lodak) at -70'C with intensifying 
screens. 
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Southern analysis 

s High molecular weight genomic DNA was prepared from white blood 
cells and cell lines (39,40). These genomic TNjLs (10yg) were cleaved 
with restriction anzymea following the manufacturers recommended 
conditions and fraction ed through 0.6% and 0.7* agarose gels. Cloned 
plaamid DNA was cleaved and fractionated on I.SJJ agarose. The gels 
were denatured, neutralized and transferred to nylon membranes 
(Biodyne) according to the manufacturer's instructions. The EcoR1 
Insert from pKUCl 0 was separated on a 1J5 low Belting point agarose 
(Biorad) gel and labelled with [ a - 52 P]dCTP b y the method of ' random 
priming (41) and hybridized to filters at 42 # C. Filters were washed 
down to 0.11 SSC with 0.1% SDS at 55 # C and exposed to ZAR-5 film 
(Kodak) at -70"C with intensifying screens. 



Results 



Purification a nd deglvcosylation of mucin glycoprotein 

Mucin glycoprotein reactive witk the monoclonal antibody H1£PG-1 
was prepared from pooled human breast milk by using an H1£?G-1 antibody 
affinity column, followed by molecular sieve chromatography on 
Sephadex G-75 in order to remove lower molecular weight components 
(figure 7. lane 1). In order to demonstrate the homogeneity of the 
purified molecule, amino acid analyses of four separate preparations 
were performed and revealed a fairly consistent composition with 
serine, threonine, proline, alanine and glycine accounting for 58% of 
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the a=ino acids ( Periodic acid ailver .t&ined gels 

* 

revealed a diffuse band of greater than 400,000 daltona risible only 
when the gel was treated with periodic acid before the ailver stain 
(Wji 7* lane 2). Vo other lower molecular weight bands were 
wisualixed on the gel using the ailver etain without prior treataent 
with periodic acid. 

The purified material was subjected to treatment with hydrogen 
fluoride to remove the O-linked sugars that are characteristic of 
mucin glycoproteins. Two different reaction conditions were uaed 
which resulted in a partially deglycosylated core protein (treated at 
0 # C for 1 hr) and a fully deglycosylated core protein (treated at roo: 
temperature for 3 hr) as determined by iodinated lectin binding 
.following separation by gel electrophoresis and transfer to 
nitrocellulose paper (20). The partially deglycosylated core protein 
was reactive with wheat germ agglutinin, peanut agglutinin and helix 
pomnatia lectin (which recogni2eB the linkage sugar 
X-acetylgalactosaaine) whereas the fully stripped protein ahowed no 
reactivity with any of these three lectins. 

Ihe hydrogen fluoride treated core protein wae aeparated by 
electrophoresis through KaDodSO^/polyacrylaaide gels (10?) and ailver 
stained. Silver staining revealed that the prador-i^ant component of 
the partially stripped mucin was a high molecular weight band of about 
400 id, although faint bands of lower aolecular weight could also be 
observed (Fig.B , lane 1). Since the high molecular weight material 
ahowed a somewhat increased mobility in the gel and reacted with the 
lectin recognising the linkage sugar, it can be assumed tbat aome 
sugars had been removed. The fully stripped mucin consisted of two 
bands of about 68 led and 72 kd (Fig. 8 f 2). 
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Antibody reactive proteins produced by VfCF-7 cellB 

The kCF-7 breast cancer cell line expresses large amounts of 
aCFG-l and -2 reactire material on its cell surface (14) and was thus 
Judged to be a suitable source of mRKA for a cDNA library. Before 
proceeding to screen the MCF-7 library with the aonoclonal antibodies, 
they were tested for their ability to precipitate a component from in 
yitro translation products produced from KCF-7 ttRKA. Poly (A) + RNA 
from HCF-7 was prepared and translated in vitro . Proteins from the 
translation reaction were immunoprecipitated' uaing the monoclonal 
antibodies f »«£7G-2, SM-3 and and displayed by 

polyacrylamide .gel electrophoresis and fluorography (Fig, 9). Two . 
proteins of about 68 kd and 92 kd were icaunoprecipitated by SM-3 
(lane 2) and (lane 1) # It was also found that HL£TG-1 (lane 4) 

and Hfc£7G-2 (lane S) immunoprecipitated these proteins; however, no 
bands in these areas were precipitated by an irrelevant monoclonal 
antibody to human interferon (lane 5). The fact that HH7G-1 and -2 
imaunoprecipitated these proteins was an unexpected finding as it was 
previously thought that these HAbs recognixe carbohydrate 
determinants (1), However, we also found that HMFG-1 and -2 react 
very strongly with the fully stripped, iodinated core protein (20). 
These results together" with the l£Ab reactions on the p-gal*ctosidase 
fusion proteins (see below) confirm that the epitopes for HUFG-1 and 
-2 are,, at least in part, protein in nature. 

The abundance of the core protein rnRXA in total cellular poly (A)* 
JltfA was 4% as estimated by comparing the amount of ( !5 S)methionine 
present as itmaunoprecipitated protein to the amount of methionine 
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incorporated into total protein during in vitro translation. 

Be reer.ing of the cDKA library 

The Xgt11 cDKA library Bade froa size selected MC7-7 aRKA (see 
Methods) was screened initially with the polyclonal antiserua made to 
the mucin core protein which had been stripped of its carbohydrate. 
Screening of 2 i 1 0 6 plaques resulted in 11 positive clones, 7 of 
which were taken successfully through two further rounds of plaque 
purification. 

To deaonstrate that the reactivity of the phage clones with the 
antibody probes was due to antigenic deterainants on the cDNA 
translation product, p-galactosidase fusion proteins were_made froa 
all 7 clones. The proteins were separated by electrophoresis, 
transferred to nitrocellulose paper and probed with a variety of 
antibodies to the stripped mucin, including the polyclonal antiserua 
which was used initially to select the clones and a cocktail of SM-3 
and SH-4. In addition, K£?G-1 and HHPG-2, the two monoclonal 
antibodies which originally detected this differentiation and 
-tuaour-SBSociated epithelial nucin (1,14) were tested. All 7 clones 
yielded fusion proteins which were specifically recognized by the 
polyclonal antiserua, the aonoclonal cocktail, and Ifl£?0-2. Ifl£?G-1 
antibody reacted with 6 of the 7 fusion proteins and failed to 
recognise the protein from clone 9 which contains the smallest insert. 
In every case the strongest signal was given by the H)£FG-2 antibody 
and this reaction is shown in Figure 10. Monoclonal antibodies to 
keratinB and to a carbohydrate epitope on this fully glycosylated 
mucin were used as controls and shoved no reactivity 
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A monoclonal antibody to p-galactosidase was a positive control and 
the band recognized correlated In every case with the band recognized 
by the specific antibodies. The sizes of the fusion proteins varied 
in proportion to the sizes of tie cDKA inserts found in the 
bacteriophage. 

Characterization of cDNAs and RKA blot analysis 

The inserts from the X clones were designated p!TUC3-10- (omitting 
pMUC5) and were subcloned into the rector pUC B for easier 
manipulation. The 7 clones were compared to each" other for sequence 
homology.. Each of the plasxids was digested with EcoRI and the insert 
separated on a 1.4% agarose gel. The largest cDKA insert from pMUClO 
was used to probe the inserts and found to hybridize to all 6 inserts 
(Pig* 11 ). P^UC 7 was found to contain two inserts following digestion 
with EcoRI; however, only 1 of the inserts hybridized to the pHUCIO 
probe. The insert bands were not derived from phage DNA aince the 
pMUCIO probe did not hybridize to Hind Ill-digested X phage DNA 

As ahown by agarose gel electrophoresis (Pig. 11) f the 
inserts vary in aize from about 200 to up to about 1600 bp. The 
largest insert from pirtJCIO has been used as the hybridization probe in 
all subsequent experiments. 

Because the JLWJC clones were identified only by. antibody binding, 
we needed additional assurance that they were indeed coding for the 
breast epithelial mucin. To determine the authenticity cf plfUCIO, we 
correlated the presence of aRHA hybridizing to the clone with mucin 
expression in various cell lines. As shown in figure 1%, the cDKA 
hybridized to two transcripts of 4,7 kb and 6.4 Id? in the RNA from the 
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breast cancer cell lineB MC7-7 and T4 7D which wore shown previously to 
express the HkJ?G-2 antigen (1,14). Significantly, the pUUClO probe 
hybridized. to transcripts of approximately the same aire in-RMA 
•xtracted from normal mammary epithelial cells cultured from milk 
(42). A third band of 5.7kb can be sees in the UNA from these normal 
cells. la contrast, three human cell types that lack the mucin, 
breast fibroblasts, Daudi cells and HS578T, a carcinosarcoma line 
derived from breast tissue (43), ahowed no detectable plfUCI 0-related 
jaRNA. The 6.4 kb band appears to be the most adundantly expressed. 
The presence of at least two sizes of mRKA f rom -UCF-7 cells correlates 
with the inmuaoprecipitation of two proteins of (molecular weights 65 
kd and 92 kd) from in vitro translated mRHA from UCF-7 cells. The 
normal, mammary epithelial cells were derived from pooled milk aamples 
and the additional transcript observed may be due to polymorphisms 
among individuals. 

Genomic PKA blot hybridiza tion and detection of a restriction fragment 

length polymorphism CKFLP) 

Genomic DNA was prepared from a panel of ten individuals 
consisting of six unrelated individuals and a family of four, and from 
three cell lines. The UN As which were digested with Hinfl or EcoRI 
and blotted and hybridized to the radiolabelled plfiJCIO insert, exhibit 
restriction fragment length polymorphisms. The .restriction fragments 
from the ten individuals and three cell lines are shown in figure 13. 
The pattern consists of either a aingle band, or a doublet of sizes 
ranging from S400bp to 6200bp in the Hinfl digest (with the exception 
of the ZR75-1 X>NA in lane 12, figure 13A»hich shows three bands) or 



SUBSTITUTE SHEKT 



WO 88/05054 





PCT/GB88/00011 



-46 



fron 8200bp to 9600bp in the EcoRI digest (Figure 13 B ) . There appears 
to be a continuous distribution of the fragment sizes which iopliea a 
high in viTO instability at the locua. Ine pattern of fragments 
observed in the family of four (lanes 1-4) suggests that these 
fragments are allelic. Preliminary studies of the DNA made from white 
blood cells of normal, related individuals indicate the sxistance of a 
number of independent alleles with an autosomal codominant mode of 
Inheritance JhBB0 
studies will be the subject of a separate investigation. 



The cDNA clones described here which were obtained from the 
lgt11 library were selected -using polyclonal and monoclonal antibodies 
prepared against a normal cellular product, the milk mucin in its 
deglycosylated form. This was done because it was easier to obtain 
large quantities of the mucin for stripping than to.prepare similar 
quantities of immunologically related glycoproteins expressed by 
breast cancer cells (44). The fact that the antibodies did select for 
cDNA coding for nonglycosylated core protein molecules in MCF-7 cells, 
strongly suggests that the glycoproteins in these cells, which were 
originally detected by their reaction with antibodies to the milk 
mucin, contain the same core protein aa this mucin. This is confirmed 
by the detection of aRNAa of approximately the same sizes in the 
normal and malignant cells, using one of the probes isolated from the 
UCP-7 library. We will therefore refer to the antibody reactive 
glycoproteins on breaBt cancer cells as mucins, bearing in mind that 
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their processing Bay be different resulting in molecules of different 
nolecular weights but with the same' core protein as that of the milk 
mucin . 

Seven clones were obtained fron the MCF-7 library of which the 
largest was ISOOrb. This clone cross hybridized with the other 6 
•mailer clones. The p-galactosidase fusion proteins expressed by six 
of the cross-hybridizing laabda clones were reactive with the 
polyclonal, antiserum directed against the mucin core protein as well 
as with four well-characterized monoclonal antibodies directed to 
various epitopes on the stripped core protein, SH-3. SM-4, MffG-1 and 
H^G-2 (14,20). The smallest laabda clone, 11TJC9, produced a 
p-galactosidase fusion protein which reacted with three of the four 
monoclonal antibodies and with the polyclonal antiserum. 

. The surprising result that the extensively characterized fa£?G-1 
and JO£FG-2 monoclonal antibodies reacted strongly with the lambda 
plaques and the fusion proteins and could iamunoprecipitate proteins 
from in vitro translated aRNA provides strong evidence that these 
clones do indeed code for a portion of the mucin core protein. 
Although previous evidence eucb as resistance to fixation, boiling, 
treatment with dithiothreitol and KaDodSO^ and the presence of ' 
multiple epitopes on the molecule suggested that. these were 
carbohydrate (1 ) , it has row been established that the epitopes cf the 
»£7G-1 and H£TG-2 monoclonal antibodies are definitely protein in 
nature. Carbohydrate may be required to obtain the. etrongeat binding, 
either aa part of the epitope or by conferring some conformational 
change on the protein portion, but part of the antigenic determinant 
must consist of an amino acid sequence. Since these two JiAbs are 



VZSTiTUTZ SHEET 



WO 88/05054 * ) '\ PCT/GB88/0001 1 

-48- ' 



reactive with the fully glycosylated Bilk mucin as well as the 
stripped core protein, this data means that the intact molecule 
contains areas of naked peptide which contribute to the antigenic 
sites for these two antibodies. 

Confirmatory evidence that plTUCI 0 codes for the mammary mucin core 
protein is proTided by RJiA blots. The relative abundance of aMA in 
the breast cancer cell lines HC7-7, T47D, ZR-75-1 and in normal 
. »aaaary epithelial cells corresponds to the antigen expression by 
theae cella as meaaured by the binding of the H1£?G-1 and EV£PG-2 
monoclonal antibodies. Cell types which are negative for antigen 
expression such as human fibroblasts. Daudi cells and HS576T, a 
carcinosarcoma line derived from breast (14), are negative in UNA blot 
hybridizations. A fortuitous observation made with the ZR-75-1 
cells yielded indirect strong evidence that pilUCIO does indeed code 
for the oucin glycoprotein core protein. This cell line, which 
routinely expresses large amounts both of nKKA and antigen, yielded 
one preparation of UNA which was unexpectedly negative by blot 
hybridization. It was subsequently found that thoae particular 
ZR-75-1 cells from which the RNA had been made had lost the expression 
of the antigen aa well at this time (as determined by reaction with 
HHFG-1 and 2). Different passage numbers of the ZR-75-1 cells were 
recovered and ahown once again to express both antigen and message. 
She aires of the messages. 4.7 kb and 6.4 kb, are quite large, aince a 
68 kd or 92 Id protein would need cmly about 5 kb to code for the 
protein portions. This suggests that a large portion of the aRXA 
aaybe untranslated. Efforts are underway to obtain a full-length 
clone . 
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Thus, the cDNA clones presented here represent a portion of the 
gene coding for the human mammary mucin which is sxpressed by 
differentiated breast tissue as well as by most breast cancers. The 
aajor proteins precipitsted froa in vitro translation products of RKA 
from UCF-7 cells by antibodies to the ailk mucin core protein (68£d) 
tave an apparent siolecular wight of 68Xd and fi2Xd. Tfcese proteins, 
produced by the breast cancer, cell . therefore share epitopes with the 
6BId core protein of the ailk smcin (20). Whether a similar P 2£d 
protein is also produced by normal mammary epithelial cellB, and is 
truncated or destroyed by KF treatment is not yet clear. HCF-7 cells 
biosynthetically labelled with 14C amino acids yield upon 
immunoprecipitation with and H£7G-2 antibodies, two 

glycosylated proteins of 320 kd and 430 kd, aad it 

is possible that each of these glycoproteins utilises only one core 
protein of either 68Id or 92Zd. Alternatively, each of the 
glycoproteins could contain both the S2Zd and 68Xd proteins either in 
different proportions or variably glycosylated. Further screening of 
the library nay yield full length cDNAs coding for both sizes of the 
immunologically related core proteins. Since there appears to be only 
a single gene (based on Southern blot data obtained by using a partial 
cDNA probe), it is probable that the multiple messages arise by 
alternative UNA splicing and this would explain the fact that they 
contain common sequences. Although a core protein of 68 kd appears to 
be small to yield a fully glycosylated molecule of greater than 500 kd 
which contains 50% carbohydrate, there is evidence tha t such a 
structure for mucins is possible. Ovine submaxillary mucin has a 
reported molecular weight of 1 x 10 6 daltons (45), yet its protein 
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core consists of 650 amino acids resulting in a aolecule of 58 kd 
(46). 

The aucins which are detected with HV£FG-1 and HHFG-2 UAbs on 
isauaoblota of tumours and breast cancer cell lines .how variations in 
sise from 80 kd to 400 kd in the molecular weights of the tumour oucin 
aoleculea (1,47). Uaing these aame antibodiea which detect nigh 
molecular weight mueina present in normal urine, a polyaorphiam has 
indeed been ahown to be genetically determined (48). although the 
very low aoleeular weight eomponenta are likely to repreaent precuraor 
forms of the mucin which appears to be incompletely proceaaed in many 
tuDour cells (20), the variations in the higher molecular weight 
coaponenta are likely to be due to thia genetic polymorphiaa. It was 
unclear, however, whether the atructural basis of tie polymorphiaa was 
due to the genetically determined protein or to the carbohydrate 
portion of the mucin. The detection of restriction fragment length 
polyaorphiams in the Southern blotting eiperiaenta uaing the mucin 
probe suggest that the mucin polyaorphiam eccura at the level of the 
DNA which codes for the protein. Preliminary aequence data 
/suggest that the baaia for thia polyaorphiam ia a region 
of variable tandem repeats present in the protein coding aequencea. 
Thia atructural feature nay be reaponaible for the generation of the 
xaany allelic reatrietion fragaenta at the mucin locua. We are 
presently inveatigating the baaia of the mucin polyaorphiam by a 
Southern blot aurvey of DHA from white blood cella of normal, related 
individuals whoae inheritance pattern of urinary mucins has been 
determined. In addition, we are examining DHA preparations aade from 
the white blood cells and tumours of individual breast cancer patients. 
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to determine if there is any discordance between genotype in the . 
paired samples, since tandesly repeated PNA may provide an unstable 
■ite where recombination or amplification could occur. 

The presence of mucin a in the majority of carcinomas and their 
aa.ociation with the differentiation of mammary epithelial cells makes 
it particularly important to identify regions involved in the tissue 
specific and developmental regulation of the gene. Moreover, the 
introduction of a functional mucin gone into cells should provide 
insights into the role cf this molecule in breast epithelial 
differentiation and possibly enable us to identify any alterations in 
the function or expression of the mucin which are related to malignant 
transformation in the human breast. 
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■Abbreviating ........ 

The abbreviations are as follows: PBS, phoaphate-buf f ered saline • 
KAb, monoclonal antibody I'lFM." isoprop 7 l 0-D-thiogalactoside ; bp. 
base pair(s); lb, kilobaae (s) . 




■4? - 



SUBSTITUTE SHEET 



) 



-53 - 



f L"l/Uli88/UUUll 



TABLE 1 



A=lno ecid eoeooBitlnn gf the huaaj yit ttuetn _ 



eop-parleon ^-ltv. Ptg-n 



Aaino acid 



HWC-1 purified fctea.ir.ly .tripped P AS_o 

1882) 



.Asp 
Thr 
Ser 
Glx 



Hi. 



6.1 
9.4 
9.1 
6.3 



2.0 
3.2 
2.6 
4.0 



7.2 
9.7 
13.0 
9^6 



Pro 


14.8 


14.4 


Gly 


8.1 


10.1 


Ala. 


12.3 


11.9 


Cya 


Not analysed 


Not analysed 


Tal 


6.0 


6.3 


Met 


0.5 


0.4 


He 


1.6 


1.7 


Leu 


4.5 


4.8 


3yr 


2.0 


0.9 



1.6 
2.5 
S.S 
4.0 



6.4 
9.8 
15.1 
8.5 
12.0 
12.2 
13.0 
0.5 
5.5 
0.8 
1.9 
5.7 
1.6 
1.7 
5.8 
2.2 
3.9 
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Sable g 

Reactivity of the antibody p. lntaet . p , rf <. 1lT .„ 



-Antibody 



125 



lntaet aolecule 



1 cpo bound 



Partially Totally 
stripped aucla stripped xcmcin 



5.17 


6,524 


11,925 


5,780 


9.13 


525 


3,000 


3,328 


Eii-3 


465 


15,414 


. 9,200 


SM-4 


816 


16,750 


9,561 


HUFG-1 


52,000 


33,768 


9,494 


HW7G-2 


29,500 


29,230 


15,832 


KS2 aediua 


397 


645 


650 



The blndiBB of the antibodie. to iodinated Intact, partially and 
totally de^lycoeylated mil* wcia *ae „ 8ayed „ ia£ the A 
plate aethod aa deacribed in Material, and Methods. 
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